Fix inconsistent `typeSize` calculation for `TupleN` vs recursive pair encodings #24743

Lluc24 · 2025-12-13T11:23:54Z

Currently, typeSize reports inconsistent values for standard tuple types (e.g., (A, B)) compared to their semantically equivalent recursive pair encodings (e.g., A *: B *: EmptyTuple).

This discrepancy arises because TupleN is represented as a flat AppliedType, whereas the nested encoding forms a deeper tree structure. As typeSize is often used as a heuristic for complexity or optimization limits, this inconsistency can lead to divergent behavior in the compiler depending on how a tuple is represented syntactically.

This PR modifies TypeSizeAccumulator to canonicalize TupleN types into their recursive *: representation before calculating their size. This ensures that the size metric is consistent regardless of whether the tuple is represented as a flat AppliedType or a nested structural type.

I have added a new unit test in TypesTest that asserts Tuple3[Int, Boolean, Double] and Int *: Boolean *: Double *: EmptyTuple both yield identical typeSize equal to 3.

Thanks to @mbovel for providing the unit test that effectively reproduces this issue and validates the fix.

Fixes #24730

mbovel · 2025-12-14T19:18:52Z

We could maybe use normalizedTupleType here:

scala3/compiler/src/dotty/tools/dotc/core/TypeUtils.scala

Lines 148 to 157 in f67a1a0

    
               /** If this is a generic tuple type with arity <= MaxTupleArity, return the 
        
                *  corresponding TupleN type, otherwise return this. 
        
                */ 
        
               def normalizedTupleType(using Context): Type = 
        
                 if self.isGenericTuple then 
        
                   self.tupleElementTypes match 
        
                     case Some(elems) if elems.size <= Definitions.MaxTupleArity => defn.tupleType(elems) 
        
                     case _ => self 
        
                 else 
        
                   self

mbovel · 2025-12-14T19:25:02Z

compiler/src/dotty/tools/dotc/core/Types.scala

          val tpNorm = tp.tryNormalize
          if tpNorm.exists then apply(n, tpNorm)


Suggested change

val tpNorm = tp.tryNormalize

if tpNorm.exists then apply(n, tpNorm)

val tpNorm = tp.normalized.normalizedTupleType

if tpNorm ne tp then apply(n, tpNorm)

normalizedTupleType does the opposite of what we want. We need to convert TupleN to *: equivalent. I found this function in TypeUtils.scala that does exactly this:

scala3/compiler/src/dotty/tools/dotc/core/TypeUtils.scala

Lines 122 to 126 in c82b623

/** The `*:` equivalent of an instance of a Tuple class */

def toNestedPairs(using Context): Type =

tupleElementTypes match

case Some(types) => TypeOps.nestedPairs(types)

case None => throw new AssertionError("not a tuple")

Also note that tp.normalized on TupleN returns NoType.

Using this function, the code is cleaner and more readable:

def apply(n: Int, tp: Type): Int = tp match { case tp: AppliedType if defn.isTupleNType(tp) => foldOver(n + 1, tp.toNestedPairs) // From here the following code is the same as the original case tp: AppliedType => val tpNorm = tp.tryNormalize if tpNorm.exists then apply(n, tpNorm) else foldOver(n + 1, tp) ...

We need to convert TupleN to *: equivalent.

Why not do the opposite? TupleN needs less object allocations.

Also note that tp.normalized on TupleN returns NoType.

Maybe the following would work?

tp match { case tp: AppliedType => val tpNorm = tp.tryNormalize if tpNorm.exists then apply(n, tpNorm.normalizedTupleType) else foldOver(n + 1, tp.normalizedTupleType)

The above has the advantage that it always normalizes the type, even when it's a tuple.

The motivation of this issue is that the concatenation type of two tuples is made by a Match Type. When a TupleN and another tuple were to be concatenated, the result type uses the *: equivalent. The resulting typeSize is larger and created a lot of false positives in the algorithm of PR #24661

Yes, but from my understanding, the requirement is simply that both TupleN and nested *: pairs be normalized to the same representation. Does it matter whether TupleN is normalized to nested pairs, or vice versa? If we normalize to TupleN, then both types in the unit tests would have size 1, wouldn't they?

Well, I guess for your current implementation of match types termination checks, it's better if (1, 2) and (1, 2, 3) have sizes 2 and 3.

Previously, `typeSize` reported different values for standard `TupleN` types (e.g., `(A, B)`) compared to their equivalent recursive pair encodings (e.g., `A *: B *: EmptyTuple`). This discrepancy occurred because `TupleN` is a flat `AppliedType`, while the nested encoding forms a deeper tree structure. This patch modifies `TypeSizeAccumulator` to canonicalize `TupleN` types into their recursive `*:` representation before calculating the size. This ensures that the size metric is consistent regardless of whether the tuple is represented syntactically or structurally. This change is verified by a new unit test in `TypesTest`, which confirms that both `Tuple3[Int, Boolean, Double]` and its recursive equivalent `Int *: Boolean *: Double *: EmptyTuple` now yield identical `typeSize` values. Fixes scala#24730

Lluc24 changed the title ~~Fix inconsistent typeSize for TupleN vs nested pairs~~ Fix inconsistent typeSize calculation for TupleN vs recursive pair encodings Dec 13, 2025

Lluc24 mentioned this pull request Dec 13, 2025

i22587 Divergence check for Match Types #24661

Draft

Lluc24 force-pushed the i24730-TupleN-wrong-typeSize branch from cd3a120 to dca3ea1 Compare December 13, 2025 14:01

mbovel reviewed Dec 14, 2025

View reviewed changes

Lluc24 force-pushed the i24730-TupleN-wrong-typeSize branch from dca3ea1 to b290509 Compare December 15, 2025 15:14

Gedochao assigned mbovel Dec 16, 2025

mbovel self-requested a review December 16, 2025 17:24

mbovel approved these changes Dec 16, 2025

View reviewed changes

mbovel merged commit 402be90 into scala:main Dec 16, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix inconsistent `typeSize` calculation for `TupleN` vs recursive pair encodings #24743

Fix inconsistent `typeSize` calculation for `TupleN` vs recursive pair encodings #24743

Lluc24 commented Dec 13, 2025

Uh oh!

mbovel commented Dec 14, 2025

Uh oh!

mbovel Dec 14, 2025

Uh oh!

Lluc24 Dec 15, 2025

Uh oh!

mbovel Dec 15, 2025

Uh oh!

Lluc24 Dec 15, 2025

Uh oh!

mbovel Dec 16, 2025

Uh oh!

mbovel Dec 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		val tpNorm = tp.tryNormalize
		if tpNorm.exists then apply(n, tpNorm)

	/** The `:` equivalent of an instance of a Tuple class /
	def toNestedPairs(using Context): Type =
	tupleElementTypes match
	case Some(types) => TypeOps.nestedPairs(types)
	case None => throw new AssertionError("not a tuple")

Fix inconsistent typeSize calculation for TupleN vs recursive pair encodings #24743

Fix inconsistent typeSize calculation for TupleN vs recursive pair encodings #24743

Conversation

Lluc24 commented Dec 13, 2025

Uh oh!

mbovel commented Dec 14, 2025

Uh oh!

mbovel Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

Lluc24 Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

mbovel Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Lluc24 Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

mbovel Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

mbovel Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix inconsistent `typeSize` calculation for `TupleN` vs recursive pair encodings #24743

Fix inconsistent `typeSize` calculation for `TupleN` vs recursive pair encodings #24743